首页> 外文OA文献 >AdaScan: Adaptive Scan Pooling in Deep Convolutional Neural Networks for Human Action Recognition in Videos

【2h】

AdaScan: Adaptive Scan Pooling in Deep Convolutional Neural Networks for Human Action Recognition in Videos

机译：adascan：深度卷积神经网络中的自适应扫描池视频中的人类行为识别

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

We propose a novel method for temporally pooling frames in a video for thetask of human action recognition. The method is motivated by the observationthat there are only a small number of frames which, together, containsufficient information to discriminate an action class present in a video, fromthe rest. The proposed method learns to pool such discriminative andinformative frames, while discarding a majority of the non-informative framesin a single temporal scan of the video. Our algorithm does so by continuouslypredicting the discriminative importance of each video frame and subsequentlypooling them in a deep learning framework. We show the effectiveness of ourproposed pooling method on standard benchmarks where it consistently improveson baseline pooling methods, with both RGB and optical flow based Convolutionalnetworks. Further, in combination with complementary video representations, weshow results that are competitive with respect to the state-of-the-art resultson two challenging and publicly available benchmark datasets.

机译：我们提出了一种新的方法，用于在视频中临时合并帧中的人类动作识别任务。该方法是由于观察到只有少量帧，这些帧一起包含足够的信息来将视频中存在的动作类别与其余的区别开来。所提出的方法学会合并这些区分性和信息性帧，同时在视频的单个时间扫描中丢弃大多数非信息性帧。我们的算法通过不断预测每个视频帧的区别重要性，然后将它们合并到深度学习框架中来实现。我们在标准基准上显示了我们提出的合并方法的有效性，该方法通过基于RGB和基于光流的卷积网络不断改进了基准合并方法。此外，结合互补的视频表示，我们在两个具有挑战性且可公开获得的基准数据集上显示了与最新结果相比具有竞争力的结果。

著录项

作者
Kar, Amlan; Rai, Nishant; Sikka, Karan; Sharma, Gaurav;
展开▼
作者单位

展开▼
年度 2017
总页数
原文格式 PDF
正文语种
中图分类

相似文献

外文文献
中文文献
专利

1. Stratified pooling based deep convolutional neural networks for human action recognition [J] . Yu Sheng, Cheng Yun, Su Songzhi, Multimedia Tools and Applications . 2017,第11期

机译：基于分层池的深度卷积神经网络用于人类动作识别
2. Video-Based Human Action Recognition Using Spatial Pyramid Pooling and 3D Densely Convolutional Networks [J] . Wanli Yang, Yimin Chen, Chen Huang, Future Internet . 2018,第12期

机译：使用空间金字塔池和3D密集卷积网络的基于视频的人类动作识别
3. Human action recognition based on quaternion spatial-temporal convolutional neural network and LSTM in RGB videos [J] . Meng Bo, Liu XueJun, Wang Xiaolin Multimedia Tools and Applications . 2018,第20期

机译：基于四元数时空卷积神经网络和LSTM的RGB视频人体动作识别
4. AdaScan: Adaptive Scan Pooling in Deep Convolutional Neural Networks for Human Action Recognition in Videos [C] . Amlan Kar, Nishant Rai, Karan Sikka, IEEE Conference on Computer Vision and Pattern Recognition . 2017

机译：AdaScan：深度卷积神经网络中的自适应扫描池，可用于视频中的人类动作识别
5. Human Activity Recognition from Egocentric Videos and Robustness Analysis of Deep Neural Networks [D] . Lu, Yantao. 2020

机译：从深神经网络的Egentric视频和鲁棒性分析的人类活动识别
6. Gender Recognition from Human-Body Images Using Visible-Light and Thermal Camera Videos Based on a Convolutional Neural Network for Image Feature Extraction [O] . Dat Tien Nguyen, Ki Wan Kim, Hyung Gil Hong, 2017

机译：基于卷积神经网络的可见光和热成像摄像机视频对人体图像的性别识别
7. AdaScan: Adaptive Scan Pooling in Deep Convolutional Neural Networks for Human Action Recognition in Videos [O] . Kar, Amlan, Rai, Nishant, Sikka, Karan, 2017

机译：adascan：深度卷积神经网络中的自适应扫描池视频中的人类行为识别

AdaScan: Adaptive Scan Pooling in Deep Convolutional Neural Networks for Human Action Recognition in Videos

摘要

著录项

相似文献

相关主题

期刊订阅